Automated patch assessment for program repair at scale
نویسندگان
چکیده
Abstract In this paper, we do automatic correctness assessment for patches generated by program repair systems. We consider the human-written patch as ground truth oracle and randomly generate tests based on it, a technique proposed Shamshiri et al., called Random testing with Ground Truth (RGT) in paper. build curated dataset of 638 Defects4J 14 state-of-the-art systems, evaluate automated dataset. The results study are novel significant: First, improve state art performance RGT 190% improving oracle; Second, show that is reliable enough to help scientists overfitting analysis when they systems; Third, external validity knowledge largest ever.
منابع مشابه
Context-Aware Patch Generation for Better Automated Program Repair
The effectiveness of search-based automated program repair is limited in the number of correct patches that can be successfully generated. There are two causes of such limitation. First, the search space does not contain the correct patch. Second, the search space is huge and therefore the correct patch cannot be generated (i.e., correct patches are either generated after incorrect plausible on...
متن کاملDagstuhl Seminar 17022 Automated Program Repair
This report documents the program and the outcomes of Dagstuhl Seminar 17022 “Automated Program Repair”. The seminar participants presented and discussed their research through formal and informal presentations. In particular, the seminar covered work related to search-based program repair, semantic program repair, and repair of non-functional properties. As a result of the seminar, several par...
متن کاملProblems Before Solutions: Automated Assessment of Problem Understanding at Scale
Automatic assessment reduces the need for individual feedback in massive courses, but often focuses only on scoring solutions, rather than assessing whether students correctly understand problems. We present an enriched approach to automatic assessment that explicitly assists students in understanding the detailed specification of technical problems that they are asked to solve, in addition to ...
متن کاملAutomated Clustering and Program Repair for Introductory Programming Assignments
Providing feedback on programming assignments is a tedious task for the instructor, and even impossible in large MOOCs with thousands of students. In this paper, we present a novel technique for automatic feedback generation: (1) For a given programming assignment, we automatically cluster the correct student attempts based on their underlying strategy. From each class we select one student att...
متن کاملIdentifying Patch Correctness in Test-Based Program Repair
Test-based automatic program repair has attracted a lot of attention in recent years. However, the test suites in practice are often too weak to guarantee correctness and existing approaches often generate a large number of incorrect patches. To reduce the number of incorrect patches generated, we propose a novel approach that heuristically determines the correctness of the generated patches. T...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Empirical Software Engineering
سال: 2021
ISSN: ['1382-3256', '1573-7616']
DOI: https://doi.org/10.1007/s10664-020-09920-w